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5 What is claimed is: 

. A method of developing a domain-specif ic analytic 
application having at least one predefined data mining 
aodel, the method comprising the steps of: 
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15 



30 




identifying a business problem to be solved; 

selecting a data mining algorithm appropriate for 
solving the. business problem; 

defining Mata schema for use as inputs and outputs 
to and from the\mining algorithm, the data schema 
including input data schema and output data schema; and 



20 defining a datav mining model dependent upon the data 

schema, defining a da\a mining model resulting in the 
creation of a predefined data mining model; 

whereby a domain- specific analytic application is 
25 developed, the analytic application having at least one 
predefined data mining model^ 

2. The method of claim 1 whereir^the analytic application 
comprises the capabilities of: 



production training the predefined data mining model 
using the historical data in the input data schema, 
wherein use of the capability of production training the 
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predefined data mining model results in creation of a 
production trained data mining model; and 

production scoring production data by use of the 
5 prodiXption trained data mining model. 

3. The method of claim 2 wherein the capability of 
production training the predefined data mining model 
further comprises the capability of operating the 

10 predef ined\data mining model in training mode using end 
user historical data in the input data schema. 

4. The method of claim 2 wherein the capability of 
production scoring production data by use of the 

15 production trained data mining model further comprises 
the capability of\applying the production trained data 
mining model to historical data stored in input schema. 

5. The method of claii^i 2 wherein the capability of 
20 production scoring production data by use of the 

production trained dat^ mining model further comprises 
the capability of applying the production trained data 
mining model to productio\i data stored read from an end 
user's production databases 

25 

6. The method of claim 1 whertein the analytic application 
further comprises the capability of populating the input 
data schema with historical dat 

30 7. The method of claim 6 wherein Vhe capability of 

populating the input data schema with historical data 
further comprises the capabilities iof extracting from 
historical data values of prediction^ data fields and 
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writing the values of the prediction data fields into the 
inpiat data schema for the data mining model. 

8, Theynethod of claim 1 wherein identifying a business 
5 problenK to be solved further comprises identifying a 

a usineifes problem capable of expression through the use 
of refererits that are defined in a specific computational 
domain. \ 

10 9. The method W claim 1 wherein selecting a mining 

algorithm appropriate for solving the business problem 
further comprises^ selecting a radial basis function 
algorithm for value prediction. 

15 10. The method of cJSaim 1 wherein selecting a mining 
algorithm appropriate ^or solving the business problem 
further comprises selecting a neural value prediction 
algorithm. \ 

20 11. The method of claim 1 wherein selecting a mining 
algorithm appropriate for solVing the business problem 
further comprises selecting a oemographic clustering 
algorithm. \ 

25 12. The method of claim 1 whereinNselecting a mining 
algorithm appropriate for solving tnte business problem 
further comprises selecting a neural flustering 
algorithm. \ 

30 13 . The method of claim 1 wherein selecting a mining 
algorithm appropriate for solving the business problem 
further comprises selecting a tree classification 
algorithm. 
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^4. The method of claim 1 wherein selecting a mining 
algorithm appropriate for solving the business problem 
further comprises selecting a neural classification 
5 algorithm. 

15. Thfe method of claim 1 wherein selecting a mining 
algorithm, appropriate for solving the business problem 
further comprises selecting an associations algorithm. 

10 



16. The metho\l of claim 1 wherein defining data schema 
for the mining algorithm further comprises the steps of: 



fU 
ft 



15 selecting from historical data for inclusion in 

input data schema predictor fields capable of supporting 
the use of a data miking algorithm in predicting the 
value of a prediction\f ield; and 



20 selecting for inclusion in output data schema at 

least one prediction fielc 

17 . The method of claim 1 ^herein defining data schema 
for the mining algorithm farther comprises selecting 
25 for inclusion in output schema sufficient key fields to 
comprise a unique key for identification in production 
data of storage locations for\the output data from the 
data mining algorithm. 



30 18. The method of claim 1 wherein defining a data mining 
model based on the data schema further comprises 
establishing in a data structure comprising the data 
mining model definition values for fields defining the 
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model . 

19 \ The method of claim 18 wherein fields defining the 
mode\{. comprise: 

field representing the number of consecutive 
record^ to select from the input data schema to be used 
for trailing; 

a fielfi representing the number of consecutive 
10 records to sedect from the input data schema to be used 
for development scoring; 



fU 
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15 



a field limiting the number of times the data mining 
model goes througt\ its input data in training mode; 

a field limiting^ the number of fitting centers 
created by the mining, data mining algorithm at each pass 
through the input data;> 



20 a field indicating t\e minimum number of records to 

be assigned to a region; 



25 



And 



a field identifying at lWst one predictor field; 



a field identifying a prediction field. 



20. The method of claim 1 whereinVdef ining a data mining 
model based on the data schema further comprises the 
30 steps of: 



establishing in a data structure comprising the data 
mining model definition values for fielop defining the 
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m&del; and 

\development scoring historical data wherein test 
output data is created; and 
5 \ 

tes\ing the test output data for accuracy wherein an 
accuracy valuation is created; 

wherein \the steps of establishing definition values, 
10 development scoring, and testing are repeated until the 
accuracy valuation meets a predetermined accuracy 
requirement. \ 

21. A system for developing a domain-specific analytic 
15 application having auv least one predefined data mining 
model, the system compVising: 

means for identifying a business problem to be 
Solved; 

20 

means for selecting a d^Sta mining algorithm 
appropriate for solving the business problem; 

means for defining data schema for use as inputs and 
25 outputs to and from the mining algorithm, the data schema 
including input data schema and output data schema; and 

means for defining a data miningXmodel dependent 
upon the data schema, wherein use of the means for 
30 defining a data mining model results in\:reation of a 
predefined data mining model; 



wherein use of the said means for identifying a 
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business problem, means for selecting a data mining 
aPaorithm, means for defining data schema, and means for 
defining a data mining model results in development of a 
domain-specific analytic application, the analytic 
application having at least one predefined data mining 
model . \ 

22. The system of claim 21 wherein the analytic 
applicationXcomprises the capabilities of: 

production training the predefined data mining model 
using the historical data in the input data schema, 
wherein use of the capability of production training the 
predefined data mining model results in creation of a 
production trainedXdata mining model; and 

production scor:mg production data by use of the 
production trained datai mining model. 

23. The system of claim ^2 wherein the capability of 
production training the data mining model further 
comprises the capability of ^operating the data mining 
model in training mode using >end user historical data in 
the input data schema. \ 

24. The system of claim 22 wherein the capability of 
production scoring production data^ay use of the 
production trained data mining model\ further comprises 
the capability of applying the production trained data 
mining model to historical data stored\in input schema. 

25. The system of claim 22 wherein the capability of 
production scoring production data by use of the 
production trained data mining model f urtheAcomprises 
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trie capability of applying the production trained 
daoamining model to production data stored read from an 
end \iser ' s production database . 

5 26. The system of claim 21 wherein the analytic 
application further comprises the capability of 
populating the input data schema with historical data. 
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27. The system of claim 26 wherein the capability of 
10 populating th«e input data schema with historical data 

further comprises the capabilities of extracting from 
historical data Values of prediction data fields and 
writing the values of the prediction data fields into the 
input data schema \or the data mining model 

15 

28. The system of clSaim 21 wherein means for identifying 
a business problem to r>e solved further comprises means 
for identifying a business problem capable of expression 
through the use of referents that are defined in a 

20 specific computational domain. 



29. The system of claim 21 Mierein means for selecting a 
mining algorithm appropriate for solving the business 
problem further comprises means\for selecting a radial 

25 basis function algorithm for val^e prediction. 

30. The system of claim 21wherein Vneans for selecting a 
mining algorithm appropriate for solving the business 
problem further comprises means for selecting a neural 

30 value prediction algorithm. 



31. The system of claim 21wherein means\for selecting a 
mining algorithm appropriate for solving \he business 
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aroblem further comprises means for selecting a 
demographic clustering algorithm. 

32. \The system of claim 21wherein means for selecting a 
5 mininV algorithm appropriate for solving the business 
problem further comprises means for selecting a neural 
clustering algorithm. 
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10 33. The si^stem of claim 21wherein means for selecting a 
mining algorithm appropriate for solving the business 
problem furthW comprises means for selecting a tree 
classif ication\algorithm. 

15 34. The system df claim 21wherein means for selecting a 
mining algorithm appropriate for solving the business 
problem further comprises means for selecting a neural 
classification algorithm. 

20 35. The system of claifri 21wherein means for selecting a 
mining algorithm appropriate for solving the business 
problem further comprises\means for selecting. an 
associations algorithm. 

25 36. The system of claim 21wh)erein means for defining 
data schema for the mining algorithm further comprises: 

means for selecting from historical data for 
inclusion in input data schema predictor fields capable 
30 of supporting the use of a data mining algorithm in 
predicting the value of a prediction^ f ield; and 

means for selecting for inclusion \in output data 
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schema at least one prediction field. 

37. \The system of claim 21wherein means for defining 
data sthema for the mining algorithm further comprises 
means for selecting for inclusion in output schema 
sufficient key fields to comprise a unique key for 
identification in production data of storage locations 
for the output data from the data mining algorithm. 

38. The systems of claim 21wherein means for defining a 
data mining mode\ based on the data schema further 
comprises means for establishing in a data structure 
comprising the data\mining model definition values for 
fields defining the model. 

39. The system of claimv38 wherein fields defining the 
model comprise: \ 

a field representing tnte number of consecutive 
records to select from the input data schema to be used 
for training; \ 

a field representing the number of consecutive 
records to select from the input data schema to be used 
for development scoring; \ 

a field limiting the number of tasmes the data mining 
model goes through its input data in training mode; 

a field limiting the number of fitting centers 
created by the mining data mining algorithm at each pass 
through the input data; \ 

a field indicating the minimum number of Vecords to 



49 



AUS920010258US1 



# 

Patent Application 



ne assigned to a region; 

\ a field identifying at least one predictor field; 
and \ 

a\field identifying a prediction field. 

40. The Wstem of claim 21wherein means for defining a 
data mining^ model based on the data schema further 
Comprises: \ 

means for ^establishing in a data structure 
comprising the da.ta mining model definition values for 
fields defining tne model; and 

means for development scoring historical data 
wherein test output data is created; and 

means for testing tP^e test output data for accuracy 
wherein an accuracy valuation is created; 
wherein the means for establishing definition values, 
means for development scoring, and means for testing are 
capable of repeated use until Vhe accuracy valuation 
meets a predetermined accuracy ^requirement . 

41. A computer program product for developing a 
domain-specific analytic applicatioia having at least one 
predefined data mining model, the computer program 
product comprising: \ 

a recording medium; \ 

means, recorded on the recording medium, for 
identifying a business problem to be solvedV 
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means, recorded on the recording medium, for 
sleeting a data mining algorithm appropriate for solving 
tnte business problem; 
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leans, recorded on the recording medium, for 
defining data schema for use as inputs and outputs to and 
from the\mining algorithm, the data schema including 
input datk schema and output data schema; and 



means, f^corded on the recording medium, for 
defining a dat^a mining model dependent upon the data 
schema, wherein\use of the means for defining a data 
mining model results in creation of a predefined data 
15 mining model; 

wherein use of tW said means for identifying a 
business problem, meansyfor selecting a data mining 
algorithm, .means for defining data schema, and means for 
20 defining a data mining model results in development 

of a domain-specific analytYc application, the analytic 
application having at least o^ie predefined data mining 
model . 



25 42. The computer program product\of claim 21 wherein the 
analytic application comprises the\capabilities of: 



production training the predefined data mining model 
using the historical data in the input xiata schema, 
30 wherein use of the capability of production training the 
predefined data mining model results in creation of a 
production trained data mining model; and 
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production scoring production data by use of the 
production trained data mining model. 

43. Vhe computer program product of claim 22 wherein the 
5 capability of production training the data mining model 
further ^comprises the capability of operating the data 
mining moctel in training mode using end user historical 
data in thev input data schema. 



a 
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10 44. The computer program product of claim 22 wherein the 
capability of production scoring production data by use 
of the production^ trained data mining model further 
comprises the capability of applying the production 
trained data mining\nodel to historical data stored in 

15 input schema. 



to 



20 



45. The computer program product of claim 22 wherein the 
capability of production scoring production data by use 
of the production trained data mining model further 
comprises the capability of applyi n 9 the production 
trained data mining model to production data stored read 
from an end user's production database. 



46. The computer program product \f claim 21 wherein the 
25 analytic application further comprises the capability of 
populating the input data schema witnv historical data. 



47. The computer program product of clkim 26 wherein the 
capability of populating the input data schema with 
30 historical data further comprises the capabilities of 

extracting from historical data values of prediction data 
fields and writing the values of the predictrpn data 
fields into the input data schema for the data\minmg 
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48. \ The computer program product of claim 21 wherein 
meank for identifying a business problem to be solved 
5 further comprises means, recorded on the recording 
medium A for identifying a business problem capable of 
expression! through the use of referents that are defined 
in a specific computational domain. 

10 49. The computer program product of claim 21 wherein 
means for selecting a mining algorithm appropriate for 
solving the business problem further comprises means, 
3 recorded on the recording medium, for selecting a radial 

\fi basis function algorithm for value prediction. 

(jl 50. The computer prdgram product of claim 21 wherein 

^ means for selecting a raining algorithm appropriate for 

solving the business problem further comprises means, 
recorded on the recordingYmedium, for selecting a neural 
20 value prediction algorithm 

'il 51. The computer program product of claim 21wherein 

y h means for selecting a mining algorithm appropriate for 

solving the business problem further comprises means, 
25 recorded on the recording medium, \£or selecting a 

demographic clustering algorithm. 

52. The computer program product of cYaim 21wherein 
means for selecting a mining algorithm appropriate for 
30 solving the business problem further comprises means, 

recorded on the recording medium, for selecting a neural 
clustering algorithm. 
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5*3. The computer program product of claim 21wherein 
means for selecting a mining algorithm appropriate for 
solvsing the business problem further comprises means, 
recorded on the recording medium, for selecting a tree 
5 classification algorithm. 

54. The apmputer program product of claim 21wherein 
means for selecting a mining algorithm appropriate for 
solving the misiness problem further comprises means, 
10 recorded on tne recording medium, for selecting a neural 
classification algorithm. 

q 55. The computer program product of claim 21wherein 

\D means for selecting V mining algorithm appropriate for 

15 solving the business problem further comprises means, 
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(J1 recorded on the recording medium, for selecting an 



associations algorithm. 

56. The computer program product of claim 21wherein 
20 means for defining data schema for the mining algorithm 

further comprises: 

means, recorded on the recoiling medium, for 
selecting from historical data for\ inclusion in input 
25 data schema predictor fields capable of supporting the 

use of a data mining algorithm in predicting the value of 
a prediction field; and 

means, recorded on the recording medium, for 
30 selecting for inclusion in output data schema at least 
one prediction field. 

57. The computer program product of claim 23\wherein 
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nteans for defining data schema for the mining algorithm 
fur\ther comprises means, recorded on the recording 
medi^im/ for selecting for inclusion in output schema 
sufficient key fields to comprise a unique key for 
identification in production data of storage locations 
for theXoutput data from the data mining algorithm. 
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58. The computer program product of claim 21wherein 
means for defining a data mining model based on the data 
10 schema further comprises means, recorded on the recording 
medium, for establishing in a data structure comprising 
the data mining model definition values for fields 
defining the model. 

15 59. The computer\program product of claim 38 wherein 
fields defining the model comprise: 

a field representing the number of consecutive 
records to select f ron\ the input data schema to be used 
20 for training; 



25 



a field representing\the number of consecutive 
records to select from the\input data schema to be used 
for development scoring; 

a field limiting the numfc)er of times the data mining 
model goes through its input data in training mode; 



a field limiting the number of fitting centers 
30 created by the mining data mining \algorithm at each pass 
through the input data; 



a field indicating the minimum nuhjber of records to 
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be assigned to a region; 

\ a field identifying at least one predictor field; 
and \ 

a field identifying a prediction field. 

60. The. computer program product of claim 2 wherein 
means foA defining a data mining model based on the data 
schema further comprises: 

means, recorded on the recording medium, for 
establishing \n a data structure comprising the data 
mining model definition values for fields defining the 
model; and \ 

means, recorded on the recording medium, for 
development scoringv historical data wherein test output 
data is created; ancK 

means, recorded on the recording medium, for testing 
the test output data foic accuracy wherein an accuracy 
valuation is created; \ 

wherein the means for Establishing definition 
values, means for development scoring, and means for 
testing are capable of repeated use until the accuracy 
valuation meets a predetermined accuracy requirement. 
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